Enriching the "Senso Comune" Platform with Automatically Acquired Data
نویسندگان
چکیده
This paper reports on research activities on automatic methods for the enrichment of the Senso Comune platform. At this stage of development, we will report on two tasks, namely word sense alignment with MultiWordNet and automatic acquisition of Verb Shallow Frames from sense annotated data in the MultiSemCor corpus. The results obtained are satisfying. We achieved a final F-measure of 0.64 for noun sense alignment and a F-measure of 0.47 for verb sense alignment, and an accuracy of 68% on the acquisition of Verb
منابع مشابه
From Glosses to Qualia: Qualia Extraction from Senso Comune
This paper describes a case study on methods for automatically extracting qualia relations from dictionary glosses in Italian, namely the Senso Comune De Mauro Dictionary (SCDM). The qualia extraction has been addressed by means of a pattern-based approach and lexical match with an Italian generative lexicon based language resource, PAROLE-SIMPLECLIPS (PSC). The evaluation of the extraction app...
متن کاملAutomatic Domain Assignment for Word Sense Alignment
This paper reports on the development of a hybrid and simple method based on a machine learning classifier (Naive Bayes), Word Sense Disambiguation and rules, for the automatic assignment of WordNet Domains to nominal entries of a lexicographic dictionary, the Senso Comune De Mauro Lexicon. The system obtained an F1 score of 0.58, with a Precision of 0.70. We further used the automatically assi...
متن کاملLexicon and Ontology Interplay in Senso Comune
Following a fashionable recent trend in the scientific community, computational lexicons are often said to incorporate or even correspond to linguistic ontologies, whose purpose is to describe semantic constructs of language (bound to grammatical units). Nevertheless there’s a big debate on whether the categorial structures of computational lexicons could be acknowledged as ontologies or not. W...
متن کاملAligning an Italian WordNet with a Lexicographic Dictionary: Coping with limited data
This work describes the evaluations of two approaches, Lexical Matching and Sense Similarity, for word sense alignment between MultiWordNet and a lexicographic dictionary, Senso Comune De Mauro, when having few sense descriptions (MultiWordNet) and no structure over senses (Senso Comune De Mauro). The results obtained from the merging of the two approaches are satisfying, with F1 values of 0.47...
متن کاملSenso Comune, an Open Knowledge Base for Italian
Senso Comune is an open-knowledge base for the Italian language, available through a Web-based collaborative platform, whose construction is in progress. The resource integrates dictionary data coming from both users and legacy resources with an ontological backbone, which provides foundations for a formal characterization of lexical semantic structures (frames). A nucleus of basic Italian lemm...
متن کامل